A GPU-accelerated adaptive FSAI preconditioner for massively parallel simulations
نویسندگان
چکیده
The solution of linear systems equations is a central task in number scientific and engineering applications. In many cases the may take most simulation time thus representing major bottleneck further development technical software. For large scale simulations, nowadays accounting for several millions or even billions unknowns, it quite common to resort preconditioned iterative solvers exploiting their low memory requirements and, at least potential, parallelism. Approximate inverses have been shown be robust effective preconditioners various contexts. this work, we show how adaptive Factored Sparse Inverse (aFSAI), characterized by very high degree parallelism, can successfully implemented on distributed computer equipped with GPU accelerators. Taking advantage GPUs FSAI set-up not trivial task, nevertheless through an extensive numerical experimentation proposed approach outperforms more traditional results close-to-ideal behavior challenging algebra problems.
منابع مشابه
Random number generators for massively parallel simulations on GPU
High-performance streams of (pseudo) random numbers are crucial for the efficient implementation for countless stochastic algorithms, most importantly, Monte Carlo simulations and molecular dynamics simulations with stochastic thermostats. A number of implementations of random number generators has been discussed for GPU platforms before and some generators are even included in the CUDA support...
متن کاملA GPU-Accelerated Parallel Preconditioner for the Solution of the Boltzmann Transport Equation for Semiconductors
The solution of large systems of linear equations is typically achieved by iterative methods. The rate of convergence of these methods can be substantially improved by the use of preconditioners, which can be either applied in a black-box fashion to the linear system, or exploit properties specific to the underlying problem for maximum efficiency. However, with the shift towards multiand many-c...
متن کاملA massively parallel GPU-accelerated model for analysis of fully nonlinear free surface waves
We implement and evaluate a massively parallel and scalable algorithm based on a multigrid preconditioned Defect Correction method for the simulation of fully nonlinear free surface flows. The simulations are based on a potential model that describes wave propagation over uneven bottoms in three space dimensions and is useful for fast analysis and prediction purposes in coastal and offshore eng...
متن کاملMassively Parallel A* Search on a GPU
A* search is a fundamental topic in artificial intelligence. Recently, the general purpose computation on graphics processing units (GPGPU) has been widely used to accelerate numerous computational tasks. In this paper, we propose the first parallel variant of the A* search algorithm such that the search process of an agent can be accelerated by a single GPU processor in a massively parallel fa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of High Performance Computing Applications
سال: 2021
ISSN: ['1741-2846', '1094-3420']
DOI: https://doi.org/10.1177/10943420211017188